Multiobjective Reinforcement Learning Using Adaptive Dynamic Programming And Reservoir Computing

نویسندگان

Mohamed Oubbati

Timo Oess

Christian Fischer

چکیده

This paper introduces a multiobjective reinforcement learning approach which is suitable for large state and action spaces. The approach is based on actorcritic design and reservoir computing. A single reservoir estimates several utilities simultaneously and provides their gradients that are required for the actor enabling an agent to adapt its behavior in presence of several sources of rewards. We describe the approach in theoretical terms, supported by simulation results.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

An Adaptive Learning Game for Autistic Children using Reinforcement Learning and Fuzzy Logic

This paper, presents an adapted serious game for rating social ability in children with autism spectrum disorder (ASD). The required measurements are obtained by challenges of the proposed serious game. The proposed serious game uses reinforcement learning concepts for being adaptive. It is based on fuzzy logic to evaluate the social ability level of the children with ASD. The game adapts itsel...

متن کامل

Mini/Micro-Grid Adaptive Voltage and Frequency Stability Enhancement Using Q-learning Mechanism

This paper develops an adaptive control method for controlling frequency and voltage of an islanded mini/micro grid (M/µG) using reinforcement learning method. Reinforcement learning (RL) is one of the branches of the machine learning, which is the main solution method of Markov decision process (MDPs). Among the several solution methods of RL, the Q-learning method is used for solving RL in th...

متن کامل

Nested algorithms for optimal reservoir operation and their embedding in a decision support platform

This is a PhD thesis of Blagoj Delipetrev explaining nested dynamic programming, nested stochastic dynamic programming and nested reinforcement learning algorithms that are applied in reservoir optimization problem. Additionally there are also multi-objective version of these algorithms.

متن کامل

Multiobjective Reinforcement Learning for Traffic Signal Control Using Vehicular Ad Hoc Network

We propose a newmultiobjective control algorithm based on reinforcement learning for urban traffic signal control, namedmultiRL. A multiagent structure is used to describe the traffic system. A vehicular ad hoc network is used for the data exchange among agents. A reinforcement learning algorithm is applied to predict the overall value of the optimization objective given vehicles’ states. The p...

متن کامل

Learning Through Interaction

Reinforcement learning is an approach for learning optimal action policy via experiencing, i.e. using observed reward in environment states. Reinforcement learning algorithms include adaptive dynamic programming, temporal difference learning and Q-learning[1]. Examples of successful applications of reinforcement learning are controller for sustained inverted flight on an autonomous helicopter [...

متن کامل

ذخیره در منابع من

ذخیره در منابع من قبلا به منابع من ذحیره شده

{@ msg_add @}

با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره شماره

صفحات -

تاریخ انتشار 2013

Multiobjective Reinforcement Learning Using Adaptive Dynamic Programming And Reservoir Computing

نویسندگان

چکیده

منابع مشابه

An Adaptive Learning Game for Autistic Children using Reinforcement Learning and Fuzzy Logic

Mini/Micro-Grid Adaptive Voltage and Frequency Stability Enhancement Using Q-learning Mechanism

Nested algorithms for optimal reservoir operation and their embedding in a decision support platform

Multiobjective Reinforcement Learning for Traffic Signal Control Using Vehicular Ad Hoc Network

Learning Through Interaction

عنوان ژورنال:

اشتراک گذاری